Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
En | 6679 | 168 | 3 | 56.0000 |
El | 11442 | 776 | 14 | 55.4286 |
También | 1008 | 46 | 1 | 46.0000 |
La | 8749 | 673 | 15 | 44.8667 |
Los | 3265 | 268 | 8 | 33.5000 |
Se | 1610 | 90 | 4 | 22.5000 |
Su | 572 | 40 | 2 | 20.0000 |
Para | 1162 | 60 | 3 | 20.0000 |
Pero | 1346 | 53 | 3 | 17.6667 |
Las | 1927 | 157 | 9 | 17.4444 |
No | 2052 | 85 | 5 | 17.0000 |
Es | 1739 | 50 | 3 | 16.6667 |
ciento | 586 | 16 | 1 | 16.0000 |
Entre | 457 | 16 | 1 | 16.0000 |
De | 1698 | 47 | 3 | 15.6667 |
Y | 1772 | 62 | 4 | 15.5000 |
Aunque | 386 | 15 | 1 | 15.0000 |
Al | 979 | 59 | 4 | 14.7500 |
Una | 993 | 57 | 4 | 14.2500 |
pero | 4263 | 114 | 8 | 14.2500 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
millones | 1897 | 8 | 129 | 0.0620 |
vamos | 371 | 1 | 16 | 0.0625 |
grado | 103 | 1 | 12 | 0.0833 |
ola | 113 | 1 | 11 | 0.0909 |
voy | 196 | 1 | 11 | 0.0909 |
tipos | 133 | 1 | 10 | 0.1000 |
creo | 257 | 1 | 10 | 0.1000 |
van | 503 | 2 | 20 | 0.1000 |
concepto | 105 | 1 | 9 | 0.1111 |
Mexicana | 52 | 1 | 9 | 0.1111 |
estudiante | 67 | 1 | 9 | 0.1111 |
cerca | 657 | 2 | 18 | 0.1111 |
convertirse | 127 | 1 | 9 | 0.1111 |
después | 1564 | 6 | 52 | 0.1154 |
señor | 95 | 1 | 8 | 0.1250 |
canal | 125 | 1 | 8 | 0.1250 |
posibilidad | 332 | 1 | 8 | 0.1250 |
fin | 752 | 4 | 29 | 0.1379 |
cuadro | 111 | 1 | 7 | 0.1429 |
miles | 230 | 1 | 7 | 0.1429 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II